Forming intracluster gas in a galaxy 
protocluster at a redshift of 2.16 


Luca Di Mascolo’”? =, Alexandro Saro’?*4, Tony Mroczkowski°, Stefano Borgani’”*“, Eugene Churazov®’, Elena Rasia?’, Paolo 
Tozzi, Helmut Dannerbauer?”, Kaustuv Basu", Christopher L. Carilli”, Michele Ginolfi>*, George Miley, Mario Nonino?, 
Maurilio Pannella’, Laura Pentericci!®, Francesca Rizzo!” 


1 Astronomy Unit, Department of Physics, University of Trieste, via Tiepolo 11, 34131 Trieste, Italy ? INAF - Osservatorio Astronomico di Trieste, via Tiepolo 11, 34131 Trieste, Italy 3 IFPU - 
Institute for Fundamental Physics of the Universe, Via Beirut 2, 34014 Trieste, Italy 4 INEN - Sezione di Trieste, Via Valerio 2, 34127 Trieste, Italy 5 European Southern Observatory (ESO), Karl- 
Schwarzschild-Strasse 2, 85748 Garching, Germany © Max-Planck-Institut fiir Astrophysik (MPA), Karl-Schwarzschild-Strasse 1, 85741 Garching, Germany ” Space Research Institute, Profsoyuznaya 
str. 84/32, 117997 Moscow, Russia 8 INAF - Osservatorio Astrofisico di Arcetri, Largo E. Fermi, 50122 Firenze, Italy 9 Instituto de Astrofisica de Canarias (IAC), 38205 La Laguna, Tenerife, Spain 10 
Universidad de La Laguna, Dpto. Astrofísica, 38206 La Laguna, Tenerife, Spain 1! Argelander Institute for Astronomy, University of Bonn, Auf dem Hügel 71, 53121 Bonn, Germany 1? National 
Radio Astronomy Observatory, P.O. Box 0, 87801 Socorro, USA 13 Dipartimento di Fisica e Astronomia, Università di Firenze, Via G. Sansone 1, 50019, Sesto Fiorentino (Firenze), Italy 14 Leiden 
Observatory, Leiden University, PO Box 9513, 2300 RA Leiden, The Netherlands 15 INAF - Osservatorio Astronomico di Roma, Via Frascati 33, 1-00040 Monteporzio (RM), Italy 16 Cosmic Dawn 
Center (DAWN), Jagtvej 128, 2200 Copenhagen N, Denmark 17 Niels Bohr Institute, University of Copenhagen, Jagtvej 128, 2200 Copenhagen N, Denmark 


= email: luca.dimascolo@units.it 


Galaxy clusters are the most massive gravitationally bound structures in the Universe, comprising thousands of galaxies and 
pervaded by a diffuse, hot “intracluster medium” (ICM) that dominates the baryonic content of these systems. The formation 
and evolution of the ICM across cosmic time! is thought to be driven by the continuous accretion of matter from the large-scale 
filamentary surroundings and dramatic merger events with other clusters or groups. Until now, however, direct observations of 
the intracluster gas have been limited only to mature clusters in the latter three-quarters of the history of the Universe, and we 
have been lacking a direct view of the hot, thermalized cluster atmosphere at the epoch when the first massive clusters formed. 
Here we report the detection (about 60) of the thermal Sunyaev-Zeldovich (SZ) effect? in the direction of a protocluster. In fact, 
the SZ signal reveals the ICM thermal energy in a way that is insensitive to cosmological dimming, making it ideal for tracing 
the thermal history of cosmic structures’. This result indicates the presence of a nascent ICM within the Spiderweb protocluster 
at redshift z = 2.156, around 10 billion years ago. The amplitude and morphology of the detected signal show that the SZ 
effect from the protocluster is lower than expected from dynamical considerations and comparable with that of lower-redshift 


group-scale systems, consistent with expectations for a dynamically active progenitor of a local galaxy cluster. 


To measure the Sunyaev-Zeldovich (SZ) effect of the protoclus- 
ter complex surrounding PKS 1138-262 (z = 2.156; commonly 
known as the Spiderweb galaxy), we used the the Atacama 
Large Millimeter/Submillimeter Array and obtained deep Band 3 
(94.5 — 110.5 GHz) observations, exploiting both the 12-meter ar- 
ray (ALMA) and the 7-meter Atacama Compact Array (ACA). 
The transition from sparse protocluster complexes to mature, 
nearly virialized systems is a tumultuous process’”. Energetic 
events — for example, infall and accretion of the diffuse medium 
from surrounding filaments, mergers with substructures and 
feedback from active galactic nuclei (AGN) — affect the regu- 
larity of the assembling intracluster medium (ICM), with simu- 
lations predicting that their effects could persist for more than a 
Hubble time’. From an observational point of view, this implies 
that building a simple, analytical model describing the morphol- 
ogy of the disturbed proto-ICM is not a trivial task. Further, 
the possibility of mapping the ICM within galaxy protoclus- 
ters relies on our ability to separate the SZ footprint from any 
contaminating sources within the field. In fact, the Spiderweb 
galaxy harbours a powerful AGN®?, with associated hybrid- 
morphology!!! jets extending over scales of about 100 kpc. 
The result is that the millimetre-wavelength continuum signal 
in the direction of the Spiderweb protocluster is dominated by 
the emission from the central radio galaxy. In the available 
Band 3 ACA data (probing the SZ signal thanks to its ability to 
recover larger angular scales), we measure a peak surface bright- 
ness for the continuum emission from the central radio source 
of 2.39 + 0.12 mJy beam™!. By assuming that, also at the Spi- 
derweb redshift, the total mass and volume-integrated SZ signal 
scale covariantly (as established theoretically and empirically for 


low-z clusters; see ref. for a review), we find that such a flux 
estimate is at least an order of magnitude larger than the absolute 
amplitude of the peak SZ signal expected for the Spiderweb pro- 
tocluster (< 0.2 mJy beam) computed by considering an upper 
limit of about 104 Mo for the mass of the system’? (with Mo 
denoting the mass of the Sun). This large difference in the dy- 
namic range of the surface brightness of the extended structure 
of the Spiderweb galaxy and of the underlying SZ effect limits 
the possibility of performing a robust separation of the two sig- 
nals through standard imaging techniques. Thus, to handle the 
above complexities and obtain a statistically robust detection of 
the SZ signal, we need to rely on simplifying assumptions. We 
thus assume that the SZ signal is generated by a spherically sym- 
metric ICM distribution and that the extended radio source can 
be described by a collection of point-like components (Methods). 
We then analysed the available ALMA data using a Bayesian for- 
ward modelling approach!®. The inclusion of a model compo- 
nent for the SZ signal is favoured by the Bayesian evidence (that 
is, the normalization factor in Bayes’ theorem, given by the like- 
lihood marginalized over the prior volume and key element for 
performing Bayesian model selection) over the case comprising 
only the jet and AGN emission (Fig. 1) at an effective significance 
Geff = 5.97 + 0.08. This is estimated from the difference A log Z of 
the logarithm of the evidences (hereafter, log-evidence) for the 
models with and without a SZ component and by assuming the 
posterior distribution to be described by a multivariate normal 
distribution (thatis, Ce = sgn(A log Z): y2 |A log ZI). We further 
note that the reported value represents a conservative lower limit 
on the actual significance of the SZ signal, as different prescrip- 
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Fig. 1 | Binned uv profiles of the Band 3 ALMA and ACA data. a. Comparison with the radio source model. The solid lines represent the corresponding median uv profile 
for each visibility set obtained by marginalizing over different numbers of point-like components (see Methods). The model uncertainties are computed accordingly but, 
despite being plotted, are too small to be visible. The shaded regions denote the standard deviation of the azimuthal variation of the model amplitude in each uv bin, 
owing to the elongated morphology of the radio galaxy and the asymmetric visibility patterns. The systematic drop in the real component of the visibilities at short uv 
distances (that is, large angular scales) provides evidence for the presence of a SZ decrement towards the Spiderweb complex. We note that, as we do not see a similar 
deviation in the imaginary component, this cannot be ascribed merely to phase variations owing to off-centre sources. b. Comparison of SZ models with the ALMA 
and ACA data. Shown are the worst (A10 UP; ref.*) and best (L15 8.0; ref.°) SZ profiles, intended as the models that are statistically favoured the least and the most, 
respectively, based on their Bayesian evidence. Before binning the data, we subtracted the median radio model from the visibilities and shifted the phase centre on the 
SZ centroid. The uncertainties associated to the radio model and to the SZ coordinates are propagated into the ones for the ALMA and ACA data points. The shaded 
regions correspond to the 68% credible intervals for each model. Also in this case, the uncertainties and median SZ profiles are marginalized over models with varying 
numbers of point-like components. The divergence of the two models and the increase of their uncertainties at small uv distances (that is, large scales) is symptomatic of 


the limited capabilities of ALMA+ACA in constraining fluxes above about 1.70’. For both panels, the error bars denote 1o uncertainties. 


tions for the pressure distribution of the ICM further improve 
the Bayesian evidence (see Extended Data Table 2 in Methods). 
Given the above self-similar assumption for the pressure distri- 
bution, we estimate that the detected SZ signal would correspond 
to a halo with Mso = (3.461938) x 10'S Mo and rsoo = 228.984 kpc, 
where Mso is the total mass contained in the spherical volume of 
radius rs99 within which the average density is 500 times larger 
than the critical density of the Universe at the source redshift 
(here, as in the rest of the manuscript, the best-fit value and the 
uncertainties correspond to the 50", and 16" and 84" percentiles 
of the posterior distribution function for a given model param- 
eter, respectively). At face value, this mass constraint is much 
lower than the naive (and largely uncertain) expectations based 
on velocity-dispersion measurements previously reported in the 
literature!31617, We find that considering different assumptions 
for the ICM pressure distributions slightly relieves this tension 
but produce mass estimates still smaller than expected from dy- 
namical considerations. However, comparing our estimate of 
the volume-integrated SZ signal with the corresponding value 
predicted from dynamical mass estimates provides an empirical 
demonstration that the detected ICM halo is probably part of 
an extended complex of several interacting substructures. Simi- 
lar hints are observed when repeating our analysis by including 
several SZ components to the overall model, without however 
providing conclusive and statistically meaningful results. The 
observed discrepancy between the SZ mass estimate and the 
value from velocity-dispersion measurements, in combination 
with the evidence for a positional offset between the Spiderweb 
galaxy and the SZ centroid is thus consistent with a merging sce- 
nario, in which the complex structure of the pressure distribution 
of the proto-ICM is not necessarily well captured by the simple 
analytical models used in our analysis. A thorough discussion 
of the potential explanations of such discrepancy, together with 


the assessment of potential systematic issues, can be found in 
Methods. 

To circumvent the limitations of these analytic results and 
corroborate our measurements with a more realistic and physi- 
cally complex model, we perform a comparison of the SZ signal 
that we reconstructed from the ALMA+ACA data with mock 
SZ observations based on cosmological hydrodynamical simu- 
lations!?”° of galaxy protoclusters (Methods). We emphasize 
that the scope of this comparison is not aimed at identifying an 
exact simulation counterpart to the Spiderweb protocluster but 
rather at guiding our interpretation of the measured SZ effect by 
providing quantitative predictions on the overall SZ signal ex- 
pected to be measurable in the observations. We generate mock 
ALMA+ACA observations for a set of 27 simulated massive ha- 
los (Mso 2 1.3 x 10°Mo at z = 2.16) that represent progenitors 
of galaxy clusters with masses at redshift z = 0 in the range 
Mso = (5.6 — 8.8) x 10'* Mo h™!. In Fig. 2, the resulting uv profiles 
for all the simulated clusters are compared with the radio-source- 
subtracted ALMA+ACA data (as shown in Fig. 1). These show 
an agreement between the observed SZ signal, the prediction for 
halos of mass Mso = (2—5) x 10'S Mo, and, in turn, the indepen- 
dent estimates of the protocluster mass from the parametric mod- 
elling. Such a result provides a straightforward assessment of the 
reliability of the above results and, therefore, of the mismatch of 
the amplitude of the observed SZ signal with the expectations 
from dynamical considerations. 

The analysis described above required a reduction of the com- 
putational complexity by limiting the range of uv scales used 
for the model reconstruction. To obtain a high-resolution view 
of the Spiderweb complex and fully exploit the entirety of the 
dynamic range of physical scales investigated by ALMA+ACA, 
we apply an independent, sparse modelling approach to image 
the available data (see Methods for details). The resulting high- 
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Fig. 2 | uv profiles from mock SZ observations of simulated galaxy protoclusters. 
Each curve represents the uv profile obtained for a halo extracted from the Dianoga 
set after projecting the corresponding SZ map onto the visibility plane, colour- 
coded according to their mass. For a comparison, we further plot the binned ACA 
and ALMA measurements from Extended Data Fig. 6. Consistent with our finding 
from the parametric modelling, the observed SZ signal suggests that the Spiderweb 
protocluster is characterized by a mass Mso = (2 — 5) x 10! Mo. We note that this 
result is only weakly dependent on the cosmological model adopted in the reference 
simulation. As for Fig. 1, the error bars correspond to the 1ø uncertainties on the 
binned ALMA and ACA data. 


resolution image of the SZ signal obtained after subtracting the 
best-fit radio source from the ALMA+ACA data is provided in 
Fig. 3, along with a multiwavelength perspective on the Spider- 
web complex (see also Extended Data Fig. 6 in Methods). Over- 
all, the inclusion of this information in the context of the wealth 
of multiwavelength data associated with the Spiderweb system 
strongly supports an extremely dynamically active phase of pro- 
tocluster formation (Methods). The multiphase environment is 
in fact experiencing complex interactions with the extended radio 
galaxy and mass accretion from the large-scale cosmic web and 
energetic merging events”, consistent with the considerations 
from our parametric analysis. 


Overall, the reported identification in the ALMA+ACA obser- 
vations of the SZ signal from the central region of the Spiderweb 
complex is providing the direct indication that the system is 
more than a loose association of galaxies and has already started 
assembling its own halo of diffuse intracluster gas. Most impor- 
tantly, this is providing a statistically meaningful confirmation 
of long-standing predictions from cosmological simulations’ for 
the existence of an extended halo of thermalizing ICM within the 
Spiderweb protocluster, as well as of observational works, so far 
limited just to indirect!” evidence or tentative detections’. 
At the same time, this detection shows that current SZ facilities 
could be used to effectively open a new observational window 
on protocluster environments. Many such systems, including 
more massive protoclusters, are in fact expected® to exist out to 
z ~ 3. To this end, the SZ effect provides the means for obtaining 
a straightforward, unambiguous confirmation of the probable 
progenitors of local galaxy groups and clusters, as well as of the 
continuing thermalization of the forming ICM in these systems 
or any of their parts”. In fact, even in the case of the extensively 
studied Spiderweb protocluster — for which we have spectro- 


scopic confirmation for 112 member galaxies across the entire 
protocluster structure” — a robust identification of distinct sub- 
halos through a spectroscopic characterization has so far been 
ineffective. Similarly, obtaining a robust identification of a ther- 
mal ICM component with current X-ray facilities would require 
a prohibitive amount of observing time, as well as an accurate 
separation of thermal, inverse Compton and AGN contributions 
to the overall X-ray signal™*589. We refer, for instance, to the 
recent Chandra study” of the Spiderweb protocluster (see also 
Extended Data Fig. 6 in Methods), which, despite being based 
on observations with an exposure time more than an order of 
magnitude larger than the ALMA+ACA measurements used in 
this work, provided tentative but inconclusive support to the 
thermal origin of a diffuse component in the X-ray emission. 
Identifying and performing detailed characterization of many 
more of the first-forming clusters during the crucial phase of 
vigorous relaxation and thermalization will be essential for gain- 
ing a comprehensive view of the emergence of galaxy clusters 
from the large-scale structure of the Universe, as well as the en- 
vironmental processing of galaxies in the earliest phases of their 
evolution. Similarly, it will help shed light on the role of feed- 
back mechanisms in determining the physical properties of the 
diffuse baryons in the proto-ICM at the very epochs when their 
activity is expected to peak and to have a key impact on galaxy 
and protocluster formation. 
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Fig. 3 | Multiwavelength view of the Spiderweb complex. Composite Hubble Space 
Telescope image based on the ACS/WFC F475W and F814W data of the Spiderweb 
field. Overlaid are the emission from the Spiderweb galaxy and associated extended 
radio jet as measured by the Karl G. Jansky Very Large Array (VLA) in X-band!0™ 
(8 — 10 GHz; red), the image of the extended Lya nebula?! observed with the 
FORS] instrument on the Very Large Telescope (VLT; pink), and the SZ signal from 
acombined ALMA+ACA image (light blue). When imaging the ALMA+ACA data, 
we applied to the visibility weights a uv-taper with otaper = 20 kA to both suppress 
any noise structures on small scales and to emphasise the bulk distribution of the 
SZ signal from the ICM of the Spiderweb protocluster. The SZ effect offset with 
respect to the Spiderweb galaxy suggests that the protocluster core is undergoing 
a dynamically active phase. The white diamonds denote all the spectroscopically 
confirmed member galaxies summarized in ref.” and references therein. 
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Methods 


Cosmology 

In this work, we consider a spatially flat A Cold Dark Matter 
cosmological model, with Oy = 0.30, Q, = 0.70, and Hp = 
70.0 kms Mpc™. At the redshift of the Spiderweb complex 
(z = 2.156), 1” corresponds to 8.29 kpc. 


ALMA observations and reduction 

Anextensive observational campaign was performed during Cy- 
cle 6 to obtain a detailed view of the Spiderweb complex in 
Band 3 (project code: 2018.1.01526.S, PI: A. Saro). The data 
comprise measurements from the main 12-meter ALMA’! ar- 
ray in three different configurations (C43-1, C43-3 and C43-6), 
aimed at providing a high-dynamic-range view of the structure, 
as well as from the 7-meter ACA* (also known as Morita Ar- 
ray), complementing the ALMA observations over SZ-relevant 
scales (that is, over a uv range of 2.2 — 17.5 kA, corresponding to 
scales 767 — 97 kpc at the redshift of the Spiderweb galaxy). The 
spectral setup for all the configurations was tuned to cover the 
frequency range 94.5 — 110.5 GHz, split over four 2 — GHz-wide 
spectral bands centred at 95.5 GHz, 97.5 GHz, 107.5 GHz, and, 
109.5 GHz, respectively. In particular, the last window targets 
the line emission resulting from the J=3-2 transition from the 
carbon monoxide (CO; rest frequency 345.796 GHz). As we are 
interested in modelling only the continuum component of the ob- 
served signal, we conservatively exclude all the visibilities from 
the spectral window expected to contain the redshifted CO J=3-2 
line. In fact, excluding only the channels corresponding to the 
specific emission line would provide a slight improvement in the 
overall statistics of the analysed data. Previous studies*~ of the 
molecular content of the nuclear region around the Spiderweb 
Galaxy and of protocluster galaxies have shown, however, that 
the cold molecular gas within the Spiderweb complex is charac- 
terized by large velocity dispersion as well as broad differences 
in the systemic velocities of the member galaxies. Faint tails and 
offset components may hence contaminate the continuum signal 
in any channels in the proximity of the emission line, potentially 
affecting the model reconstruction. 

Data calibration was performed in the Common Astronomy 
Software Application (CASA; https: //casa.nrao.edu/) pack- 
age version 5.4.0 using the standard reduction pipeline provided 
as part of the data delivery. The direct inspection of output visi- 
bility tables highlighted no clear issues with the outcome of the 
pipeline calibration and we therefore did not perform any ex- 
tra flagging or postprocessing tuning. The resulting root-mean- 
square (RMS) noise levels of the observations are estimated to 
be 4.10 pJy beam, 29.1 Jy beam, and 14.2 Jy beam“ for the 
C43-1, C43-3, and C43-6 ALMA measurements, respectively, and 
32.0 Jy beam! for the ACA data. 

To obtain better knowledge of the spectral properties of the 
measured signals, we further include in our analyses archival 
Band 4 ALMA (project code: 2015.1.00851.S, principal investi- 
gator B. Emonts) and ACA (project code 2016.2.00048.S, prin- 
cipal investigator B. Emonts) observations. In both cases, we 
use the calibrated measurement sets provided by the European 
ALMA Regional Centre network?” through the calMS service®. 
The achieved RMS noise levels amount to 99.0 uJy beam™! and 
6.29 py beam“! for the ACA and ALMA data, respectively. As 
with the Band 3 measurements, we exclude from our analyses 
the spectral windows in Band 4 covering the CO J=4-3 and [C1] 
3P,- 3Py emission lines”. 


All interferometric images presented in this work are gener- 
ated using CASA package version 6.3.0. 


Nested posterior sampling 

To obtain a statistically robust detection of the potential SZ signal 
in the direction of the Spiderweb complex, we use the approach 
already used in Refs.183°4! in the context of ALMA+ACA studies 
of the SZ signal. In brief, we perform a visibility-space analysis, 
which allows for exactly accounting for the non-uniform radio- 
interferometric transfer function, as well as taking advantage of 
the Gaussian properties of noise in the native Fourier space. Any 
extended model component is created in image space, taking into 
account the proper frequency scaling and primary-beam attenu- 
ation for any fields and spectral windows used in the analysis, 
and is then projected onto the visibility points by means of a non- 
uniform fast Fourier transform algorithm based on convolutional 
gridding (as implemented in the finufft library; ref.””). Instead, 
in Fourier space, point-like sources are trivially represented by 
a constant function with amplitude equal to the source flux cor- 
rected for the primary-beam attenuation at the source position 
and with a phase term defined by the offset between the point 
source and the phase centre of the observations. To allow for 
Bayesian model selection and averaging, the sampling of the pos- 
terior distribution is performed by means of the nested sampling 
algorithm". We specifically exploit the implementation pro- 
vided in the dynesty (ref.*°) package, which allows for robustly 
extending the sampling problem to moderate-dimensional and 
high-dimensional models. For any further details on the model 
reconstruction, we refer to the discussion provided by Di Mas- 
colo et al. in Refs.'°°? 

Obtaining a thorough description of the small-scale, com- 
plex morphology of the extended radio signal from the Spider- 
web galaxy would require performing a pixel-level model infer- 
ence (see “Sparse imaging” section below), resulting in a poste- 
rior probability function with extreme dimensionality. Nested 
sampling techniques generally show better performances than 
Monte Carlo Markov Chain algorithms in the case of moderate- 
dimensional problems (mitigating the impacts of the so-called 
“curse of dimensionality”““°). Still, sampling from high- 
dimensional posterior distributions may easily become compu- 
tationally intractable owing to the complexity of estimating high- 
dimensional marginal likelihoods. Therefore, we decide to per- 
form a first modelling run only on a large-scale subset of the 
available data. The giant Lya nebula observed to surround the 
Spiderweb galaxy is in fact expected to be confined within a dif- 
fuse halo of hot intracluster gas!°1'4”_ In turn, the SZ footprint 
of potential intracluster gas within the Spiderweb protocluster 
should be expected to extend over characteristic scales = 10”. 
We therefore introduce an upper cut in the visibility space at a uv 
distance of 65 kA, whose corresponding angular scale is roughly 
twice the transverse width of the jet structure (that is, the size 
measured along the direction perpendicular to the jet direction) 
observed when imaging only the high-resolution ALMA data 
from the C43-3 and C43-6 observations. Such a choice makes 
the jet signal spatially resolved only along the jet axis and allows 
for describing this as a limited collection of point-like sources. 
As a result, the number of parameters required to model the 
observations remains limited and the analysis computationally 
manageable, while allowing for using nested sampling to obtain 
statistically meaningful information on our model inference. 

The selection of the specific number of compact components 
required to model the radio source was performed by means 
of Bayesian model selection. In particular, we consider as the 


most favoured model set the one for which the introduction 
of an extra point-like term would have caused a degradation 
or only a marginal improvement in the log-evidence™ (that is, 
(log Zn+1ı — log Zn) < 0.50, where n denotes the number of model 
components). For all of the components, as already described 
above, we assume the spatial morphology to be described by a 
Dirac-6 function (that is, a constant function in Fourier space with 
non-null phase term) and the source fluxes by a power-law spec- 
tral scaling. The priors for all the parameters — right ascension, 
declination, flux and spectral index — are described by uniform 
probability distributions. In particular, the right ascension and 
declination are allowed to vary within the area of r ~ 1’ enclosed 
within the first null of the antenna pattern for the Band 3 ALMA 
data at the highest available frequency (that is, 107.5 GHz). To 
avoid label switching and force mode identifiability, we impose 
an further ordering prescription”! to the right ascension parame- 
ters. The flux and spectral index parameters are instead allowed 
to vary within uniform prior probability distributions. In par- 
ticular, we constrain the source fluxes to be non-negative and 
assume an upper limit of 10 mJy on each amplitude, around an 
order of magnitude larger than the emission peak in the ALMA 
map (1.54 mJy; see top panel of Extended Data Fig. 1). For the 
spectral indices, we consider a range [—5,10], arbitrarily wide 
and set to cover both the cases of power spectra with negative 
and positive slopes, consistent with synchrotron-like and dust- 
like spectral properties, respectively. The prior limits are inten- 
tionally set to extend well beyond the values expected for such 
cases, to avoid overconstraining of the source spectral properties 
while allowing for a quick diagnostics of the actual constraining 
power of the available data with respect to spectral information. 


To describe the pressure distribution of the ICM, we instead use 
a generalized Navarro-Frenk-White (gNFW) profile’, widely 
shown to provide an accurate description of the average pres- 
sure distribution of the intracluster gas. In particular, we use 
different gNFW formulations from the literature. A summary is 
provided below. 


- The universal profile (hereafter, A10 UP) by Arnaud et al.‘ 
derived from the reconstruction of the pressure distribution in 
galaxy clusters from the REXCESS® sample and the equivalent 
model obtained from the subset of systems with clear evidence 
of a disturbed morphology (A10 MD). Although this profile is 
calibrated on massive local systems (10! Mo < Mso < 10° Mo, 
z < 0.2), itis the base model used for the mass reconstruction in 
large-scale SZ surveys and allows for a straightforward com- 
parison with the literature. 


- The pressure model™ (M14 UP) obtained from the X-ray anal- 
ysis of a high-redshift (0.6 < z < 1.2) subsample of galaxy 
clusters detected by the South Pole Telescope” (SPT). As 
for the previous case, we further consider the pressure profile 
(M14 NCC) reconstructed by excluding all the systems with 
clear evidence for the presence of a cool core, generally indica- 
tive of amore relaxed dynamical state. To our knowledge, this 
model represents the highest-redshift, observationally moti- 
vated pressure profile available at present, in turn potentially 
providing a better description of the pressure distribution in 
the Spiderweb system than the A10 models. 


- The median pressure profiles reconstructed from the Over- 
Whelmingly Large Simulations (OWLS) suite of cosmological 
hydrodynamical simulations, cosmo-OWLS, and considering 
different prescriptions for the physical model of AGN-driven 
heating of the intracluster gas. In particular, we consider the 
OWLS*® reference model (L15 REF), as well as the two AGN 


models (L15 8.0 and L15 8.5; we refer to Le Brun et al.° for de- 
tails). These models present two main advantages. First, they 
are built on simulated halos whose mass and redshift ranges 
(2x 10" Mo < Mso < 3 x 10° Mo, 0 < z < 3) broadly overlap 
with the properties of the Spiderweb protocluster. Second, at 
z ~ 2, the Spiderweb complex sits in a transitional phase for 
AGN feedback*”’, and the different flavours of the cosmo- 
OWLS pressure model allow for directly using the SZ effect to 
explore different AGN scenarios. 


- The mass-dependent and redshift-dependent extended pres- 
sure model“! (G17 EXT) calibrated on simulated galaxy clusters 
from the set of Magneticum Pathfinder hydrodynamical simu- 
lations (http: //www.magneticum.org/). Although this profile 
is computed on massive galaxy clusters (Mso > 1.4 10'* Mo), 
it provides an explicit model for taking into account the depar- 
ture from universality and self-similarity as a function of mass 
and redshift. 


In all cases, the free parameters defining the gNFW profiles are 
the plane-of-sky coordinates of the model centroid and the mass 
parameter Mso9. As for the radio model, the right ascension and 
declination parameters are prescribed to vary within the region 
encompassed by the first null of the Band 3 ALMA primary beam. 
For the mass M500, we consider a log-uniform distribution over 
the range [10!* Mo, 10'° Mol], in order to facilitate the posterior 
exploration over such a wide range of order of magnitudes. 

We also tried fitting the cool-core versions of the A10 and M14 
profiles above but found these to be systematically disfavoured 
(Alog Z < 15.5) in comparison with the listed models. This is, 
however, not surprising, as the presence of a well-formed cool 
core would be hardly consistent with the inherently disturbed 
nature of a protocluster complex. 

Finally, we account for any potential systematics with data cal- 
ibration by considering a scaling parameter for each of the mea- 
surement sets used in our analysis. For these, we assume normal 
prior distributions, with unitary central value and standard de- 
viation of 5%, as reported in the ALMA Technical Handbook for 
the considered observing cycles. 


Results. With regards to the extended signal from the Spider- 
web galaxy, the criterion introduced above for the selection of 
the number of point-like components supports the case for a to- 
tal of eight distinct components over the entire search area (we 
summarize the key information on the results of the point-like 
modelling in Extended Data Table 1). In particular, two compo- 
nents (ID1 and ID2) are found to be spatially consistent with the 
position of known protocluster members?”343549.62-4 — that is, 
ERO 284 (ref.'*) and HAE 229 (refs. ), respectively — located 
around 250 kpc west of the Spiderweb galaxy. The remaining 
components instead uniquely describe the radio emission asso- 
ciated with the Spiderweb galaxy, with one point-like component 
(ID8) specifically corresponding to the bright lobe of the eastern 
radio jet and another (ID6) being nearly coincident with the Spi- 
derweb galaxy itself (Extended Data Fig. 1). All the components 
describing the extended signal exhibit a negative spectral index, 
consistent with the synchrotron origin of the emission. The best- 
fit estimates highlight a spatial variation consistent with what is 
observed at lower frequencies in the Karl G. Jansky Very Large 
Array (VLA) data®™, which show that the spectrum of the east- 
ern lobe is, on average, steeper than that of the Spiderweb galaxy 
and the western jet. The specific values are also in rough agree- 
ment with the results from the VLA analyses. A one-to-one 
comparison is, however, not practicable, owing to the inherent 
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Extended Data Fig. 1 | Images from the nested sampling reconstruction. Raw im- 
age (top) from the Band 3 ALMA+ACA data and residuals (bottom) after subtraction 
of the marginalized model for the central radio source obtained from the nested 
sampling. We stress that we are not subtracting the model component for the SZ 
effect when generating the residual image but the limited sensitivity is making 
any SZ features non-obvious. Because our analyses are performed in the native 
visibility space, we do not perform any cleaning step. The solid and dashed white 
contours in the top figure trace the data image and the radio-source model, respec- 
tively. As evident from the comparison of the two sets of contours, the model map 
is hardly distinguishable from the data. The circles in the top panel denote the 
position of the point-like model components used to described the extended signal 
from the Spiderweb galaxy and the continuum emission from the western proto- 
cluster members. We label the point-like sources that are observed to correspond 
to distinct features or known astrophysical sources (see discussion in “Results” or 
Extended Data Table 1). The black contours overlaid on the residual map denote 
the reference A10 UP model for the SZ signal. The SZ component manifests a 
clear shift with respect to the Spiderweb galaxy, consistent with the result from the 
independent sparse-imaging analysis (Extended Data Fig. 6). Consistent with the 
nested sampling analysis, the above images are produced after applying an upper 
uv cut of 65kA to the input data. As for Extended Data Fig. 6, the triangle marks the 
position of the Spiderweb galaxy. All the contour levels are arbitrary and chosen 
with the sole intent of optimally marking the position of the best-fit SZ model. 


high-frequency spectral steepening induced by radiative losses 
and the different modelling approaches. The two offset sources, 
on the other hand, are both characterized by positive spectral 
indices. Such a trend is possibly indicating a dominant contri- 
bution from thermal dust emission already at about 100 GHz, 
and is in agreement with the potential presence of massive dust 
reservoirs in the galaxies, as already verified for HAE 229°. 


A summary of the inferred parameters for the different SZ 
models is instead provided in Extended Data Table 2. Despite 
the more or less substantial differences in the reconstructed pa- 


rameters, all the adopted pressure profiles resulted in statistically 
consistent SZ models, not allowing a statistically motivated se- 
lection of a specific description. All the assumed models, with 
exception of the L15 8.0 and L15 8.5 cases, provide mass estimates 
Mso ~ 3 x 10” Mo. In fact, we note that the L15 8.5 profile from 
Le Brun et al.° results in a mass ~ 8 x 10° Mo, consistent with 
many of the dynamical estimates reported in the literature!*'%!” 
for the Spiderweb protocluster. Considering that the profile is 
based on simulations with the prescription for an intense heating 
of the ICM owing to AGN feedback, the result might hint to a 
crucial role of the active core of the Spiderweb galaxy in shap- 
ing the intracluster/circumgalactic medium. Nevertheless, this 
model is statistically equivalent to many others in our sample, 
limiting the statistical relevance of the above considerations. For 
the same reason, any attempts of performing a comparison with 
past ICM studies of similarly high-redshift systems — for exam- 
ple, XLSSC 122 (ref.°”) or C1J1449+0856 (ref.°°), the clusters at the 
highest redshift known to date with a direct SZ measurement — 
would not be statistically meaningful. 

Finally, we note that, in our analyses, all the scaling parameters 
are found to be broadly consistent with unity. In particular, the 
scaling factors are measured to be equal to 1.04196 and 1.01*903 
for the Band 3 ACA and ALMA observations, respectively, while, 
in the case of the Band 4 data, we find parameters of 1.001998 for 
ACA and 1.03*}4t for ALMA (the reported values are given by 
the Bayesian Model Averages for all models considered in this 
work; see the “Dependence of the SZ significance on the number 
of point-like components” section below for a discussion). 


Comparison with masses from previous studies. Performing 
a proper comparison of the SZ-derived mass estimates (Extended 
Data Table 2) with independent measurements from the litera- 
ture is non-trivial. The dynamical masses for the Spiderweb 
protocluster are in fact based on velocity-dispersion estimates 
that might trace specific, yet not well-identified substructures 
and that span almost an order of magnitude: from 204 km s~! 
for one peak in the velocity distribution of Lya emitters in the 
Spiderweb field’ and up to 1360 km s7! as measured for all satel- 
lites within 60 kpc from the Spiderweb galaxy’®. Nevertheless, 
the SZ-based mass estimates presented in this work are much 
lower (a factor of about 2 — 4, depending on the model) than the 
dynamical values'*°’” found in the literature for the mass of the 
whole protocluster structure. In fact, the same studies reported 
evidence for a complex velocity structure within the central re- 
gion of the Spiderweb complex, hinting at the possibility for the 
system to be experiencing a major merger and still accreting large 
amounts of material from surrounding filaments. The fact that 
the Spiderweb system is embedded in a large-scale filamentary 
structure was confirmed by wide-field CO J=1-0 mapping. As 
such, even the very core of the Spiderweb protocluster might 
be not fully virialized. Accordingly, the integrated SZ signal 
Yz = Ysz(< 51599) we measure from the ALMA+ACA observa- 
tions — YA~^ = (1.68*933)x10-° Mpc? (see Extended Data Table 2) 
— is, for instance, a factor of 3.5 times lower than that expected, 
taking the mass inferred from the velocity-dispersion measure- 
ment oy = 683 kms‘! reported by Shimakawa et al.” for the 
galaxies within the 0.53 Mpc region surrounding the Spiderweb 
galaxy. 

This large difference between the expected SZ signal 
from velocity-dispersion measurements and the observed 
ALMA+ACA SZ integrated flux could be the result of a scenario 
in which the measured SZ signal is dominated by the contribution 
from the most prominent (sub)halo. In fact, the integrated SZ flux 


Extended Data Table 1 | Best-fit parameters for the model of the extended source. 


ID R.A. Dec. Fi09 GHz Spec. Index 
= = [mJy] = 

1 11440463089 + 008017 -26°29’11026 + 007124 0.0331790M3 = 3..6740.67 

2  11”40™46$656 + 008011 — -26°2910’787 + 007140 0.0208+99034  3,95+9-60 

3 11640483004 + 003005 -26°29’10542 + 00074  0.136+90%1 -2.602037 
4 11640488112 + 003004 -26°29’09'239 + 007029 -0.842#9-037  -1 496 +0.044 
5 11540488237 + 008008 — -26°29’08’649+ 00020 0.62770.018  -1.111+9:962 
6 11540485348 + 008003 -26°29’08'632+ 00007 -0.74178838  -0.635+9941 
7 11540489676 +003015 -26°2909"711 + 00239 0.0215*9.003 2.7414 

8  11%40™485924 + 008002 — -26°29’09’269 + 00017 0.178449-0033 2.05 40-13 


Note: We report here the optimal parameters obtained for the model without a SZ term. We note that the inclusion of a SZ component does not introduce any significant 
variation on the final value of the inferred parameters. A direct comparison between the position of point-like components and the actual morphology of the radio signal in 
the ALMA+ACA data is provided in Extended Data Fig. 1. As detailed in the text, several of point-like components coincide with specific compact sources. In particular, 
components ID1 and ID2 correspond to distinct protocluster members already reported in the literature?”*#59.623 _ ERO 284! and HAE 229°, respectively. The 
component ID6 is spatially consistent with the Spiderweb galaxy, whereas ID8 overlaps with the main spot of the eastern radio jet. 


Extended Data Table 2 | Comparison of the best-fit parameters for different SZ models. 


ARA. ADec. Mso 1500 Yaz(< r500) Ysz(< 5r500) Alog Z Deff 

["] [”] [10° Mo] [kpc] 10-6 Mp? 10% Mpc? — — 
A10 UP 10.3413 3.3413 3.461038 228.9184 0.854918 1.681633 17.8+0.5 5.97+0.08 
A10 MD 10.1412 3.4415 = 3.444338 228.4189 0.7610:19 1.424638 195+0.5 6.24+0.08 
M14 UP 997% 3.73 3.730488 23521 0.78*0-12 1.574038  17.9+0.5 5.98 + 0.08 
M14NCC 10.1417 3444 3.61298 232411 077R 1.101638 18.0+0.5 6.00 + 0.08 
L15 REF 10.847 244} 3.26793 = 224.4478 0.70281 LSZ 16.0 +0.5 5.66 0.09 
L15 8.0 10.7538 2.8415 5.46% 266.5428 1.461547 4.41*071 21.9+0.5 6.62 + 0.08 
L15 8.5 10.713 267 81241% 304.2133 2.10*6-48 T7438 19.7+0.5 6.28+ 0.08 
G17 EXT 9.94% 3.743 3081 220,2456 0.702913 1.151633 19.6+0.5 6.26+0.08 


Note: We refer to the nested sampling section in Methods for specific details of the models adopted in our analysis. The coordinates of the SZ components are 
reported as shifts with respect to the position of the Spiderweb galaxy (11'40™4834,-26°29’08’'55). The radii fso are computed from the reported Mso9 values as 
1500 = 3/Ms00/ [2500p-(z)1, where pe is the critical density of the Universe at redshift z. The log-evidence difference Alog Z for each of the listed models is computed 
with the respect to the case comprising only the extended radio source. The corresponding estimates for the effective significance deft are computed under the assumption 
of a multivariate normal posterior distribution (that is, deft = sgn(Alog Z) - y2 |A log ZI). 


Ys, scales steeply as a function of mass M, meaning that the SZ 
signal from a single halo would be larger than the sum of the SZ 
flux from a complex system of subhalos whose masses amount 
overall to the same value M. Under the assumptions that the 
Spiderweb protocluster is composed of several interacting sub- 
structures and the measured line-of-sight velocity dispersion o, 
is providing an unbiased estimate of the total mass of the system, 
we can exploit the Yz — M relation to obtain an estimate of the 
number halos of subhalos populating the Spiderweb complex. 


First, the dispersion øy is converted into a dynamical mass es- 
timate using the scaling relation calibrated by Saro et al. ona 
mock galaxy catalogue from the Millennium simulation”. This 
is rescaled to Mso assuming the conversion relation in Ragagnin 
et al.” between masses at different overdensities. We then iter- 
ate over halos and compute the integrated SZ flux expected for 
each set of subhalos. For the sake of simplicity, we consider all 


the nhals subcomponents to have equal mass Mao /Mpalos. Both 
the masses from our SZ analysis Maos and from the velocity- 
dispersion measurement Me /Npalos are then used to obtain a 
measurement of the spherically integrated SZ flux Y., by means 
of numerical integration of the pressure profiles over a volume 
of radius 5rsoọ. In our calculations, we consider the universal 
formulation by Arnaud et al.* to describe the pressure distri- 
bution of the intracluster electrons. Finally, to derive the num- 
ber of equal-mass subhalos within the Spiderweb protocluster 
whose individual SZ fluxes would match the measured value 


Yo“ = Ysz(Ms55*), we simply estimate the value Matos for which 
the equality Y.,(M35*) = Yo2(Me5 /Mnalos) is satisfied. (see Ex- 


tended Data Fig. 2). We note that adopting an underlying pres- 
sure model different from the A10 profile used for producing Ex- 
tended Data Fig. 2 is not causing a relevant variation in the total 
number of subcomponents Mpaios, Which are overall constrained 


ALMA+ACA 


Ysz [1076 Mpc?] 


Mhaloes 


Extended Data Fig. 2 | Integrated SZ flux as a function of the number of subhalos. 
The total dynamical mass Mz}, is here computed assuming a velocity dispersion of 


oy = 683 kms“! as reported by Shimakawa et al.!” for a 0.53 Mpc region around 
the Spiderweb galaxy. The green line and band correspond, respectively, to the 
median value and 68% credible interval of Y$;™^ estimated from the ALMA+ACA 
samples for an A10 UP pressure profile. The number of equal-mass subhalos with 


individual SZ fluxes equal to Y4™* and required to obtain a total mass equal to the 
q sz q q 


dynamical estimates is hence found to be n = 2.57 


to range between two to a maximum of four (except for the L15 
8.5 case, resulting in Mnnalos ~ 1). We further note that we made 
several attempts in using a physically motivated subhalo mass 
function (for example, the theoretical model provided by Giocoli 
et al.”) instead of our simple equal-mass distribution. However, 
we found that removing the strong constraint of having subhalos 
with the same masses makes the derivation of npalos or of the sub- 
halo mass parameters unconstrained, resulting in unstable and 
heavily degenerate results. We note, in any case, that all these 
considerations are derived a posteriori of the SZ modelling and, 
thus, do not affect the significance of the reported SZ detection. 

This emerging multihalo picture is also consistent with the 
identification reported in the literature’®” of double peaks in the 
velocity distribution. Nevertheless, we do not identify in the pos- 
terior distribution for the SZ model any notable secondary peaks 
that would be indicative of several pressure components in the 
ICM**4" in the direct surroundings of the Spiderweb galaxy (see 
however the discussion below on the results of a multicomponent 
analysis). This might be caused by the chance line-of-sight align- 
ment of separate halos, but neither the spectroscopic information 
on the protocluster members nor the ICM constraints allow us to 
disentangle any distinct contributions from superimposed sub- 
structures. Similarly, any subhalos with similar masses would 
also be characterized by comparable pressure distributions, in 
turn causing their SZ signal to be barely distinguishable. Over- 
all, the above result suggests that the SZ effect is tracing a minor 
portion of the larger Spiderweb structure in which the ICM has 
started building up and pressurizing, whereas the rest of system, 
extending over scales of tens of Mpc!*!4227,354965 and tracing the 
region encompassed by the turnaround radius of the overden- 
sity, has yet to undergo virialization. Any SZ signals associated 
with further subhalos or more extended structure are not con- 
strained by the observations, which have limited sensitivity and 
may suffer from large-scale interferometric filtering. 


Multiple SZ components The synchronous multiellipsoidal 
sampling” typical of the main nested sampling algorithms — 
and, in particular, of dynesty®”, the library employed for our 
analysis — would naturally break into separate posterior modes 
in the presence of several peaks in the posterior density func- 
tion. This would be the case, for instance, in the presence of 
several halos, resulting in distinct SZ components (as reported 
in, for example, Refs.“ in the case of merging cluster systems). 
However, as mentioned above, we do not find such evidence in 
the posterior probability distribution for any of the main mod- 


elling runs presented in this work. Nevertheless, we tested for 
the potential presence of any extra SZ features by performing a 
multicomponent analysis. In particular, we consider the same 
model description as in the single-halo case above but introduce 
further SZ terms. As in the case of the radio-source modelling, 
to avoid label switching, we introduce an ordering condition on 
the centroid coordinates of the SZ components. This is applied 
first to the right ascension parameters and then to the declination 
direction, to test against any bias potentially introduced by the 
specific prior choice. Nevertheless, the results are found to be 
consistent between the two modelling sets. 


Independently of the model used to describe the underlying 
pressure distribution, the sampler identifies a secondary SZ fea- 
ture 27.3'34” southeast of the Spiderweb galaxy, corresponding 
to approximately 226 kpc at the protocluster redshift. This falls 
right at the 159) boundary of the main SZ component, imply- 
ing that this secondary feature, if real, would be associated with 
a halo distinct from the one in which the Spiderweb galaxy is 
embedded. The actual existence of such a structure however 
cannot be firmly assessed. The images produced with the high- 
resolution algorithm (see the “Sparse imaging” section below) or 
after subtracting the radio-source model from the low-resolution 
set do not provide any clear evidence for any off-centre SZ struc- 
ture. A lack of spatial correspondence is also noted with respect 
to the protocluster members, as the secondary SZ component 
cannot be clearly associated with any specific concentration of 
member galaxies, indicative of a distinct collapsed halo. The 
absence of a correspondence with protocluster galaxies further 
limits (if not excludes) the chances for the SZ component to be 
associated with the secondary velocity peak mentioned above. 
Above all this, the Bayesian evidence of the model comprising 
two SZ component improves upon the one-component case by 
a factor of only (log Zn,,=2 — log Zn,,=1) S 1.84, corresponding to 
an effective significance of Ge S 1.90. As such, the ALMA+ACA 
data available at present are not able to support the unequivo- 
cal, statistically significant identification of a secondary pressure 
component. 


In fact, increasing the flexibility of the SZ model beyond the 
two-halo scenario does not provide any effective improvements 
in the overall reconstruction. In particular, the inclusion of a third 
component induces the Bayesian evidence to degrade, with a 
reduction with respect to the two-component case of (log Zy,,-3— 
log Zn,,=2) S —1.67 and a limited improvement with respect to 
the reference single-halo model (log Zn,,=3 — log Zn,,=1) S 0.17 
(which converts to Ge < 0.60). At the same time, we observe 
that allowing for an ellipsoidal pressure distribution provides 
a marked improvement in the significance of the model (Ge S 
4.41). However, this concurrently makes the sampling converge 
to a hardly physical solution, with a plane-of-sky eccentricity 
e = 0.96197 — that is, corresponding to a plane-of-sky minor 
axis being only 4% of the respective major axis. This is mainly 
a consequence of the strong degeneracy between the mass (that 
is, the parameter controlling the overall amplitude and scale 
radius of the SZ signal) and the line-of-sight extent of the ICM 
distribution, governed by the eccentricity parameter €. In fact, 
SZ data alone cannot provide information on the line-of-sight 
distribution of the optically thin ICM. We thus have to force 
the line-of-sight scale radius to be equal to the geometric mean 
of the major and minor axes of the three-dimensional ellipsoid, 
assumed for simplicity to lie on the plane of the sky. 

Overall, the main consequence for the main SZ detection with 
data available at present is that a single spherically symmetric 
halo is sufficient to provide a statistically exhaustive description 


of the SZ footprint of the Spiderweb protocluster. The result of 
the elliptical modelling can only be interpreted as a marginal 
indication of an underlying morphological complexity, without 
however providing a conclusive and meaningful answer on the 
spatial properties of the SZ signal. At the same time, as demon- 
strated for the radio-source model, any resolvable asymmetry 
should be naturally traced by an ordered multicomponent model. 
Gaining a better understanding of the morphological properties 
of the forming ICM would require achieving improved quality 
from the observational side, both in terms of sensitivity and fre- 
quency coverage. Most importantly, though, the results reported 
above suggest that the robust identification of the SZ signal al- 
ready with the simple spherical model provides only a lower 
limit to the actual significance of the detection and that this could 
only be enhanced when including in our models the description 
for any irregular and asymmetric features. 


Dependence of the SZ significance on the number of point-like 
components. First, we note that we do not observe a substan- 
tial variation in the fluxes of the compact components between 
the modelling runs with and without a SZ component. This 
provides a straightforward test of the robustness of our model 
reconstruction, in particular assuring against being driven in the 
SZ identification by the oversubtraction of the radio source. To 
properly assess whether the significance of the SZ signal is, how- 
ever, dependent on the specific assumption on the number of 
point-like components, we rerun the SZ modelling for the entire 
sample of model setups considered for finding the optimal set 
of compact sources. A summary of such a test for our reference 
model (A10 UP) is provided in Extended Data Fig. 3. We note that 
we consider here only the case n > 4, as this is found to represent 
the minimal condition for observing a sensible improvement in 
the image-space residuals and for the sampler not to suffer from 
slow convergence. 

The first outcome is that increasing the number of point-like 
components beyond our reference model (n = 8) induces a drop 
in the Bayesian evidence, thus not justifying any further exten- 
sion of the radio source to n > 9 (second panel). This implies 
that, despite all the parameters remaining practically unvaried 
beyond the optimal set with n = 8, increasing the number of 
point-like sources to n > 9 makes the modelling incur in data 
overfitting. On the other hand, before n = 6, the models ex- 
hibit a rapid increase in the overall significance in comparison 
with the null case, that is, the data-only run with no model com- 
ponents except for the cross-data calibration parameters (first 
panel). Concurrently, for n < 5 the right ascension and decli- 
nation of the SZ centroid (fourth and fifth panels) are observed 
to roughly collapse on the values inferred for the secondary SZ 
feature identified at low significance in the multicomponent run 
discussed above. Corresponding to a region with low primary- 
beam amplitude (< 0.50, depending on the specific array and 
band), this is compensated by an abrupt increase in Mso when 
moving to n < 5 (sixth panel), in turn resulting in a more ex- 
tended (and thus more severely filtered) SZ signal. Despite be- 
ing naively favoured by statistical reasoning on o¢f (third panel), 
we note that the overall effective significance of the SZ models 
for n < 5 is, however, substantially lower than the stable n > 6 
cases (first panel), thus limiting the validity of this reconstruc- 
tion. Further, the cases n < 5 correspond to radio-source models 
that substantially underfit the data and fail in describing the com- 
plex morphology of the extended emission from the Spiderweb 
galaxy (for this, we refer to Extended Data Fig. 4). As such, the 
identification of such an offset SZ feature cannot be reliably asso- 
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Extended Data Fig. 3 | Impact of the number n of compact sources on the model 
reconstruction. From top to bottom, denoting with Z the Bayesian evidence for 
each model: effective significance ox = sgn(Z% — Z%) - (2|Z% — ZĂI)! with respect 
to the null, data-only run (n = 0) of the models comprising only the radio-source 
component (x = pts) and the one including a SZ term (x = pts + sz); variation 
of the effective significance Aox = sgn(Z% — Z¥_1)- (ZX — Z%_1|)!? in the radio 
source-only and full models due to the increment in the number of point-like 
components; effective significance off = sgn(ZP*s? — ZPts) . (2| Zpts+sz — TPs] 1/2 
of the SZ component with respect to the respective radio-only model; deviation in 
right ascension (ARA) and declination (ADec) from the coordinates of the Spiderweb 
galaxy (11'40™48834,-26°29’08//55) as for Extended Data Table 2; mass parameter 
Mso; total flux of the radio model. After a swift evolution in the model evidence 
and parameters for n < 5 (grey area), the model nearly stabilise around the reference 
solution (that is, for n = 8, marked as a black square in all panels) before incurring 
ina degradation of its statistical significance at n > 9 (as clearly shown in the second 
panel by the negative AoE atn = 9). We finally note that, despite being both plotted, 


the terms oP**” and Ao®e"* are not visible because, on the scales considered in the 
first two panels, they practically overlap with and are indistinguishable from the 
respective radio-source points o? and Ao. The error bars for each point denote 
the amplitude of the 68% (~ 1c) credible interval for the corresponding posterior 
distribution. 


ciated with the actual presence of any physical component and 
might be induced by spurious systematic features in the visibility 
data. 

Despite overall supporting the effectiveness of our modelling 
and choice for the reference model, the above results clearly 
highlight a marginal level of variance introduced by the different 
assumptions on the number of compact sources. To take this into 
account and to limit any bias consequent to the choice of a specific 
model as reference, we thus decide to use Bayesian Model Aver- 
aging” to generate model-marginalized profiles when compar- 
ing our reconstruction with observations (see Fig. 1). In practice, 
we computed cross-model posterior probability distributions by 
applying a weighted average to the original model posteriors, 
with the weight of each sampling point set equal to the respec- 


tive Bayesian evidence. To fully account for any degeneracies 
between different model parameters, we generate radio and SZ 
uv models for each posterior sample and each pressure model, 
and then apply the Bayesian model averaging reduction to the 
resulting collection of uv profiles. 


Systematics in the Bayesian analysis. We note that, because 
the dynamical mass estimates are computed assuming virial 
equilibrium for the entire structure, their values represent up- 
per limits to the real mass of the Spiderweb protocluster. On 
the other hand, our SZ-derived mass estimates might be affected 
by non-trivial systematic biases associated with the assumptions 
used in our modelling. 

First, the conversion between SZ signal and total mass is de- 
rived under hydrostatic equilibrium considerations. Given the 
disturbed nature of a galaxy protocluster, we can instead expect 
notable non-thermal contributions to the overall pressure sup- 
port to the ICM — ranging from turbulent motion to dynamical 
effects caused by recent or continuing merger events. At the same 
time, hydrodynamical simulations”? show that the growth of 
the mass of a system resulting from a merger event does not 
correspond to a concomitant increase of the thermal SZ signal, 
owing to the temporal offset between mass evolution and gas 
thermalization. If the Spiderweb protocluster is observed while 
experiencing a merging process, the intrinsic SZ signal would 
thus be lower than expected from standard mass-observable scal- 
ing relations. 

Second, the reconstruction of the SZ signal relies on the as- 
sumption of self-similarity across cosmic time of the halo proper- 
ties. Although this is observed® to hold on large scales for galaxy 
clusters even up toz ~ 1.9, we still lack an exhaustive description 
of the average thermodynamic properties for the ICM within pro- 
tocluster complexes, as well as any observational information for 
the potential self-similar appearance of protocluster halos with 
the ones in their z < 2 descendants. Also, for the sake of compu- 
tational feasibility, we rely on the relatively strong assumption 
of spherically symmetric distribution for the electron pressure 
within the ICM. Nevertheless, the overall depth of the available 
data limits the possibility of obtaining better constraints on the 
SZ effect from the Spiderweb complex or achieving an improved 
separation of the signals from the radio source and the under- 
lying SZ effect able to highlight any diffuse ICM halo with low 
surface brightness. Similarly, asymmetries in the morphology 
of the pressure distribution and, therefore, in the resulting SZ 
distribution, as well as to any secondary ICM components pop- 
ulating the Spiderweb complex, cannot be firmly assessed (see, 
however, “Multiple SZ components” above for a discussion). 

Third, given the limited information across the millime- 
tre/submillimetre window with sufficient sensitivity to constrain 
the SZ spectrum, we are not able to disentangle any contribution 
to the overall SZ signal in the direction of the Spiderweb proto- 
cluster from the kinetic term. Similarly, we cannot constrain the 
relativistic correction to the SZ spectrum. Such effect is however 
generally subdominant at the virial temperature expected for 
such a low-mass system (for example, the relativistic correction 
to the thermal SZ effect at 2 keV is of the order of 1.2%; we refer 
to Mroczkowski et al. for a review of the different contributions 
to the SZ effect). Therefore, we assume the measured SZ signal 
to be entirely caused by its non-relativistic thermal component. 


Sparse imaging 
The standard tools available at present for performing radio- 
interferometric imaging and deconvolution are not optimally 
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suited to the joint reconstruction of the signal from an extended 
radio source superimposed over a diffuse SZ decrement. Cross- 
contamination may in fact cause both an underestimation of the 
flux of the former when using cLEAN-like algorithms®!, and, at 
the same time, introduce a notable suppression of the under- 
lying SZ footprint. Also, common approaches exploiting scale 
separation between the signature from the radio sources and the 
ICM in galaxy clusters would not be able to provide a robust 
characterization of the two signals, as their characteristic scales 
exhibit a broad overlap in the case of the Spiderweb protocluster. 

Building on the extensive literature on compressed sensing** * 
and sparsity-based component separation®”** and imaging”, 
we thus developed an algorithm for taking full advantage of both 
the different and, in the case of the SZ effect, well-constrained 
spectral behaviour of the measured signals, as well as the infor- 
mation on the different spatial-correlation properties. In particu- 
lar, we assume the total surface brightness I, in a given direction 
on the plane of sky (x,y) and at a given frequency v to be de- 
scribed as 


(1) 


Here, Irs(x, y) is the surface brightness of the radio source com- 
puted at the reference frequency vo, while a(x, y) is the corre- 
sponding spatially varying spectral index. The terms I(x, y) 
and g3,(v) denote, instead, the amplitude in units of Compton 
y and spectral scaling for the thermal SZ effect??. Because, as 
already discussed above, we are not including in our SZ model 
any corrections owing to relativistic terms, the spectral SZ scal- 
ing is determined solely by the frequency and therefore does not 
contribute to the overall model through any specific free param- 
eter. To achieve a high-fidelity reconstruction of the complex 
morphology of the structure of the radio jet from the Spiderweb 
galaxy, we model I,,,(x, y) and a(x, y) as a collection of pixels with 
angular scale matched to the longest baseline according to the 
Nyquist sampling theorem (given the maximum uv distance of 
Umax ~ 1.3 MA, we set the pixel scale equal to 0.5/Umax ~ 0.07”). 
On the other hand, the signal-to-noise ratio of the SZ effect from 
the Spiderweb complex is not high enough to allow for adopting 
an analogous approach to constrain the SZ decrement. To exploit 
the expected large-scale correlation, we hence describe I.,(x, y) 
through an isothermal, circular f model’’, with free centroid co- 
ordinates, amplitude normalization and core radius, whereas we 
arbitrarily fix 6 to a value of 1 (commonly used in ICM studies 
and adopted, for example, by the SPT collaboration as the base 
for their matched-filter cluster template*”). We tested against 
potential biases introduced by the specific choice of $ but did not 
observe a dependence of the solution on this parameter. Clearly, 
this model is less refined than the gNFW profile broadly used in 
the literature and in the low-resolution analysis presented above. 
Nevertheless, here, we are mostly interested in capturing the av- 
erage properties of the faint SZ distribution, while providing a 
good estimate of the base level on top of which to measure the 
surface brightness of the radio structure and avoid its undersub- 
traction when imaging the SZ effect. 

Using a non-parametric approach for reconstructing the ra- 
dio jet has the clear advantage of allowing for constraining 
the signal on the finest scales accessible in the ALMA+ACA 
measurements. However, this causes the model to comprise 
O(10*) free parameters, making the inference problem intractable 
through posterior sampling methods (for example, Markov 
chain Monte Carlo, nested sampling). At the same time, solv- 
ing simultaneously for all the free parameters in the model 
of Eq. 1 through an optimization scheme would imply fac- 


T(x, y) = Tes, y) - (v/v)? + Tsz(X, y) ` Ssz(v) 
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Extended Data Fig. 4 | Variation of image-space residuals with respect to the number of point-like components used to model the extended radio emission. 
Consistently with Extended Data Fig. 1, the images in all panels are generated considering only the data at uv distances smaller than 65 kA. For the sake of simplicity, we 
did not apply any deconvolution procedure to the above images. In fact, the large-scale patterns observed in the top-left panel (n = 4) reflect the non-uniform response 


pattern of the available visibility data. 


ing the shortcomings of a non-convex likelihood function. We 
hence use an iterative, block-coordinate optimization process” 
with proximal point update’. In this scheme, we alter- 
natively solve at a time only for one set of parameters — 
namely, the radio-source amplitude, its spatially varying spec- 
tral index and the SZ centroid coordinates, normalization and 
core radius — through a box-constrained limited-memory Broy- 
den—Fletcher-Goldfarb-Shanno algorithm”. We use the im- 
plementation provided in the scrpy’” package and assume all 
the standard values for controlling the convergence of the algo- 
rithm. The selection of the specific coordinate set to solve for 
is performed by means of the Gauss-Southwell rule!13, which 
introduces an ordering in the optimization blocks based on the 
norm of the Jacobian of the corresponding likelihood function. 
When modelling the radio-source amplitude, we consider a 
Lasso-like regression! step in order to promote sparsity, 
1 
Lins = 2 (2) 
where 72, is computed when fixing all the model parameters but 
Iks, while the constant A, controls the strength of the ¢;-norm 
regularization. To mitigate the bias and amplitude truncation” 
introduced by the ¢; norm, we further iterate over the full block- 
coordinate loop after reweighting the regularization constant ac- 
cording to the previous solution for the radio source. At every 
major step k of the optimization algorithm, we scale the regular- 
ization hyper-parameter as 


a + Ae, IZeslle, 


ek-D ea 


A® = 
e+ EM, " 


fy 


(3) 


where e“ is an auxiliary coefficient aimed at ensuring numer- 
ical stability. We assume this to decrease with the number of 
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reweighting iteration, e = max (e10~, o,;x), with e® given by 
the standard deviation per pixel of the signal as measured di- 
rectly from the dirty image, and o is the RMS of the image-space 
noise per pixel”. Similarly, we can express the initial value AS 
in units of opi, as’ a9 = NOpixd;W;, Where w; are the weights of 
the individual visibilities and n is a user-defined quantity con- 
trolling the overall depth of the modelling process (that is, the 
optimization algorithm neglects any image-space features with 
significance lower than about no,j.). Each block-coordinate opti- 
mization loop is then run until the relative variation in the value 
of the likelihood function between two full sets of updates falls 
below an arbitrary tolerance of 1078. The same tolerance on the 
relative change of the likelihood function between two consecu- 
tive solutions is further used for interrupting the iteration over 
the reweighting steps. 

For the starting values for the model parameters, we consider 
® to be null everywhere and set a® to the spectral index of 
-0.70 typical for synchrotron emission. The coordinates of the SZ 
distribution are initially set on the phase centre of the Band 3 
ALMA observations, roughly coinciding with the Spiderweb 
galaxy, whereas the initial amplitude and scale radius are ini- 
tially set equal to zero and to an arbitrary scale of 30”, respec- 
tively. We tested for such choices by injecting random values for 
each parameter block at the beginning of the optimization run 
and found no substantial deviations in the final results. 


In our analysis, we attempted to impose an extra smoothing 
to the radio-source amplitude through a total squared variation 
regularization!” , found to guarantee, in astronomical imaging, 
better results than the sole f;-norm or standard isotropic total 
variation. However, we observed only a minor impact on the 
final solution and hence decided not to include any total squared 
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Extended Data Fig. 5 | Extended radio source from the sparse modelling. Shown 
in the background is the maximum-a-posteriori model of the surface brightness for 
the radio source associated with the Spiderweb galaxy obtained from the sparse- 
image reconstruction discussed in Methods. The white contours denote the radio 
emission as measured from the VLA X-band!°1! data (see also Fig. 3). To facilitate 
the comparison, we smoothed the sparse ALMA+ACA model to the same angular 
resolution as that of the VLA data. Except for a limited number of isolated and 
low-significance pixels, it is possible to observe a good agreement between the 
morphology of the reconstructed source and the VLA observation. 


variation term in the likelihood function. Similarly, the lim- 
ited significance of the available data did not allow for success- 
fully exploiting any wavelet-based algorithms, generally demon- 
strated §?106108-110 to provide an improved performance compared 
with simple image-based techniques. 

In our analysis, we adopt a conservative threshold n = 4. In 
our tests, a lower parameter showed improved subtraction capa- 
bilities but also deteriorated purity in the solution. As an internal 
cross-check, we further tested against the recovered flux of the 
radio source, finding a maximum-a-posteriori estimate from the 
sparse modelling of 2.606 mJy against the value 2.602+8:008 mJy 
inferred using the nested sampling approach. Finally, the recon- 
structed source manifests a morphology that matches closely the 
signal measured by the VLA in X-band (Extended Data Fig. 5). 


Insights from multiwavelength information 


The combination of the ALMA+ACA data with the wealth of the 
multiwavelength information available on the Spiderweb galaxy 
allows us to propose several potential scenarios that could ex- 
plain the intensity and morphology of the SZ signal and the 
associated offset with respect to the Spiderweb galaxy itself. Al- 
though mild spatial deviations between ICM halos and central 
galaxies are not a surprise even in moderately disturbed systems, 
the identification of such a displacement can inform our obser- 
vational understanding of the dynamical state of a structure (see 
also “Comparison with simulated protoclusters” below). Never- 
theless, the available ALMA+ACA observations are not allowing 
to gain any insights on the physical and thermodynamic state of 
the forming ICM beyond the SZ detection — for example, the 
fraction of the total ICM reservoir that has already thermalized, 
the overall virialization state, the relative contribution of large- 
scale accretion, mergers and AGN feedback in heating the as- 
sembling ICM, the multiscale morphology of the proto-ICM and 
its connection (or absence thereof) with the physical properties of 
protocluster galaxies. Obtaining a robust characterization of the 
intertwined processes determining the complex physical state of 
the forming ICM will thus necessarily require to further improve 
the observational coverage of the protocluster. 

Given the inherently disturbed nature of the Spiderweb pro- 
tocluster, the system could be characterized by marked contri- 
butions from the kinetic SZ effect in the direction of intracluster 
substructures with non-negligible relative velocity with respect 
to the average bulk motion of the whole system. In particu- 
lar, several studies'”! report that the protocluster galaxies in 
the region southwest of the Spiderweb galaxy exhibit a relative 
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blueshift compared with that of the central galaxy. This was 
interpreted as evidence for the presence of an approaching back- 
ground feature, potentially tracing a filament infalling onto the 
protocluster core. Assuming that such structure hosts a nascent 
intracluster gas, this would thus be associated with a reduction 
of the total SZ flux owing to the infilling of the thermal SZ signal 
by a kinetic SZ contribution. An approaching ICM component 
with the same mass as that corresponding to the observed SZ 
feature would require, to completely cancel out its own ther- 
mal SZ effect, a line-of-sight velocity of vlos = —1928*83} km s7! 
when assuming the temperature estimate Te = 2.007973 keV from 
the Chandra X-ray data”. Alternatively, assuming the empirical 
mass-temperature scaling by Lovisari et al.'’? and the resulting 
temperature of Te = 0.85132 keV, the required velocity would 
be Utos = —7964322 kms !. Notably, the former velocity value 
roughly corresponds to the largest velocity reported by Miley 
et al., whereas both estimates are consistent with the aver- 
age velocity for both the Lya-emitting satellite members”! or the 
blueshifted components!® measured by the Spectrograph for IN- 
tegral Field Observations in the Near Infrared''? (SINFONI) on 
the Very Large Telescope (VLT). The same observations further 
show the presence of a small overdensity of redshifted members 
in the region immediately northeast of the Spiderweb galaxy. 
Similarly, the asymmetry in the SZ effect with respect to the cen- 
tral galaxy could be alternatively caused by an enhancement of 
the total SZ decrement owing to a negative kinetic contribution. 


Although a notable contribution of the kinetic SZ effect can 
thus explain some of the multifrequency observations, we argue 
that this is probably not the only process responsible for the ob- 
served asymmetry. In fact, if the kinetic SZ effect were solely 
responsible for the skewed SZ decrement in the direction of the 
Spiderweb core, the X-ray emission should not be affected. We 
do instead observe a marginal asymmetry between the extended 
halo and the Spiderweb galaxy also in the Chandra X-ray data of 
the Spiderweb field, from which the bulk of the extended ther- 
mal emission is reported” to have a mild shift towards the 
northeast with respect to the X-ray signal from the central AGN 
(central panel of Extended Data Fig. 6; we refer to a forthcom- 
ing, dedicated paper! for an extended description of the X-ray 
data processing). However, a further suppression of the over- 
all X-ray signal is observed in the region southwest of the main 
core — analogous to the SZ case — hence suggesting a local de- 
pletion in the forming intracluster halo. Recently, Anderson et 
al. and Carilli et al.” reported direct evidence for the interac- 
tion of the extended radio jets with the Spiderweb environment, 
which could potentially result in the inflation of cavities in the 
diffuse protocluster gas. As commonly observed in the cores of 
local cool-core clusters, the presence of buoyant bubbles would 
induce a reduction of both the X-ray and SZ signals over the lim- 
ited regions covered by the cavities — provided that the bulk of 
their pressure is because of non-thermal components''® — and 
thus contribute to the observed asymmetry in the ICM emission. 


Although potentially important, none of the processes dis- 
cussed above seem to effectively explain why the proto-ICM 
centroid is offset with respect to the Spiderweb galaxy (Fig. 3), 
and the centres of the Lya halo (right panel of Extended Data 
Fig. 6) and of the Ha nebula” in which the Spiderweb galaxy 
is embedded. In a relatively relaxed system, the central galaxy 
should in fact sit deep in the centre of the gravitational well of 
the protocluster complex, along with most of the gas undergoing 
virialization, whereas the warm gas nebula is expected'0"'478 to 
be embedded in an extended halo of hot plasma. However, the 
lack of strong evidence for dominant SZ or X-ray signatures from 
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Extended Data Fig. 6 | Diffuse gas around the Spiderweb galaxy. The three panels show a comparison between the SZ effect (left) observed in the ALMA+ACA data with 
the X-ray emission measured by Chandra (centre) and the VLT/FORS1 image?! 3®111 of the extended Lya nebula (right) within the Spiderweb complex. The ALMA+ACA 
SZ data are imaged using a natural weighting scheme after subtracting the sparse-modelling solution for the radio source. To reduce the dynamic range of the X-ray map, 
we suppress the strong emission from the central AGN by fitting and subtracting a point-like model to the data. The hatched region denotes the signal dominated by the 
inverse Compton X-ray emission associated with AGN or the radio jet'®!!” rather than the thermal bremsstrahlung from diffuse hot gas. For reference, we include in 
dark red the same VLA contours as in Extended Data Fig. 5. When discussing the extended X-ray signal from the ICM, we will then refer only to the region outside this 
mask (that is, excluding the radio jet). The contours in all the panels trace the decrement in the ALMA+ACA residual image after the application of a 20 kA Gaussian 
uv-taper to enhance the large-scale features associated with the SZ effect. The levels start at 20 and increase by a factor 0.50, with ø = 5.30 Jy beam! as measured from 
the corresponding jackknifed image. We mark with a white triangle the position of the Spiderweb galaxy. The ellipses in the lower-left corner of the left panel denote the 
beams for the ALMA+ACA SZ images at the original (solid edge) and uv-tapered (dotted edge) resolutions. Although the low significance and, in the case of the ALMA 
data, complex effects associated with interferometric filter hamper the possibility of observing an exact correspondence between the SZ and X-ray images, both of these 
manifest an asymmetry towards the northeast in their morphological signatures, suggesting an association of the two signals with a hot feature in the diffuse intracluster 
gas. On the other hand, the Lya halo occupies the region corresponding to the southwest deficiency observed in both the SZ and the X-ray data. This result might imply 
that the very core of the Spiderweb galaxy is dominated by cold and warm gas that has not yet been shock-heated to the virial temperature. 


such a large-scale halo, in combination with the numerous mea- As reported by Tozzi et al., the radio galaxy might be observed 
surements*??16 showing that the central approximately 70 kpc in the midst of the establishment of the radio-mode feedback 
in the Spiderweb complex hosts a massive reservoir of atomic regime and potentially providing a subefficient heating mech- 
and molecular gas, hints at a substantial, possibly predominant anism. We can use the estimates of the non-thermal pressure 
presence in the core region of warm and cold gas phases rather and the volume of the radio-bright regions provided by Carilli et 
than ionized gas at the virial temperature. Emonts et al." fur- al." to roughly evaluate the non-thermal energy provided by the 
ther report the detection of a tail in the CO J=1-0 line emission radio jet, Enn = 6 x 10° erg. Models of jet propagation through 
from the surroundings of the Spiderweb galaxy, interpreted as the ICM suggest that a comparable amount should already 
being associated with preferential accretion of cold gas along be deposited into the shocked intracluster gas. In fact, the jet 
a filament. Notably, such emission is oriented along the same energy Enth is comparable with the ICM thermal energy within 
direction as the asymmetry in the SZ and X-ray morphologies. the ~ 60 kpc sphere encapsulating the radio jets. This means 
At the same time, past observations provide evidence for rela- that the mechanical energy might have a non-negligible impact 
tive motion of the Spiderweb galaxy within its own molecular on the thermodynamics of the gas (and SZ flux) from the same 
halo*"!°, as well as a large-scale systematic asymmetry in the region. However, to effectively affect the thermal energy of the 
velocity distribution of the protocluster members, denoting or- ICM in a few 10!3 Mo cluster up to fso, the jet has to operate 
dered flows of galaxies towards the central galaxy’®*’. Such for an order-of-magnitude longer time. All in all, despite the 
motion could also naturally explain the difference observed in broad evidence™!!”?”1!9 for a major cross-interaction between 
the radio morphology of the two opposite jets from the central the central AGN and the assembling environment, the available 
radio galaxy’”''8 (see Fig. 3). The western lobe is in fact found data are still not allowing for inferring a conclusive answer on 
to be experiencing a stronger interaction with the environment the relative contribution of AGN heating (versus, for example, 
than its eastern counterpart. Although recent studies'®!! based gravitational process as infall and major mergers) to the overall 
on VLA observations of the Spiderweb galaxy satisfactorily ex- thermal energy budget of the system. An investigation of the en- 
plained this behaviour as resulting from the inhomogeneity of ergetics and thermodynamics of the AGN-ICM system will thus 
the density field in the surroundings of the Spiderweb galaxy, be deferred to a future study. 

it is not possible to exclude that the relative motion of the ra- , ae 

dio galaxy within the diffuse intracluster atmosphere is in fact Finally, it is important to note that the correspondence of the 
contributing to the overall ram pressure enhancement. All these dominant features and, in particular, of the lack of thermal sig- 
hints, taken together, suggest that the shift between the bulk dis- nal to the west of the Spiderweb galaxy in both the X-ray and SZ 
tribution of the proto-ICM and the Spiderweb galaxy may be the data (see central panel of Extended Data Fig. 6) exclude that any 
net result of the merging activity experienced by the system, with potential undersubtraction of the signal from the extended radio 
a multiphase subhalo or filament infalling into the cold, compact SOUTCE could have a dominant effect on our results (see, for exam- 
core from the east-northeast and causing a local enhancement of ple, “Nested posterior sampling” above for a collection of tests 


the SZ and X-ray signals through shock or adiabatic compression ©” the model reconstruction). Nevertheless, improvements to 
of the intracluster gas our reconstruction would require a more extensive observational 


coverage, including deeper X-ray and multifrequency SZ mea- 

It is worth mentioning that the clear displacement between surements, and higher angular resolution in particular for the 
the proto-ICM halo and the Spiderweb galaxy might be indica- latter. It is worth noting that the measured integrated Compton 
tive of the subdominant role of AGN feedback in providing the Y parameter — Y(< 5rso) = (1.682933) x 10-°Mpc? — corresponds 
energy support required to heat the assembling intracluster gas. toaSZ flux density of ~ 0.6mJy at 100 GHz. Such measurements 
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are however barely in reach of current-generation subarcminute- 
resolution millimetre-wave facilities. 


Comparison with simulated protoclusters 


The parametric modelling presented in the previous sections has 
the fundamental advantage of providing a straightforward and 
highly informative characterization of the average properties of 
Spiderweb protocluster (for example, the halo mass). Never- 
theless, owing to the many simplifying assumptions required to 
perform the model reconstruction using sensitivity-limited mea- 
surements, the analytic models used in our analysis are not able 
to describe the finer details of the intracluster gas. Furthermore, 
all the models for the ICM pressure profile available in the lit- 
erature at present and explored in this work are calibrated on 
systems on a different mass or redshift range than the Spider- 
web protocluster. In fact, we still lack an analytic description 
derived from observational considerations of the average ther- 
modynamic distributions and, in particular, of the characteristic 
pressure profile for galaxy clusters and protocluster structures at 
z z 2. Finally, because these models provide mean pressure pro- 
files, they cannot capture the diversity and the possibly complex 
morphology of the pressure structure of the ICM in protoclusters. 

The shortcomings of the parametric modelling motivates the 
use of hydrodynamical simulations to perform a more thorough 
exploration of the properties of the SZ signal measured in the 
ALMA+ACA observations of the Spiderweb system. We stress 
again that this analysis has only the scope of presenting a qualita- 
tive comparison with numerical predictions of objects evolved in 
a cosmological environment, without any pretence to identifying 
and analysing an exact numerical equivalent of the Spiderweb 
complex. The simulated protoclusters analysed in this work 
are taken from the “Dianoga” simulation set”°. In particular, 
these comprise five regions that at z = 0 have sizes in the range 
25 — 40 comoving Mpc h!t, selected from a parent 1 h! Gpe 
dark matter-only cosmological volume around five massive ob- 
jects (with Mso at z = 0 in the range (5.636 — 8.811) x 104 Mo h71). 
Such regions are resimulated with the N-body hydrodynami- 
cal code OpenGadget3, at high resolution (with dark matter and 
gas particle masses of 3.4 x 107 h! Mo and 6.1 x 10° h! Mo, 
respectively). Besides hydrodynamics, which is described by 
an improved version!”? of the smoothed-particle hydrodynam- 
ics, these simulations include radiative cooling, star formation, 
chemical enrichment and energy feedback from both supernovae 
and AGN (we refer to Planelles et al. and Bassini et al.” for de- 
tails on previous versions of the simulations used here and to a 
forthcoming paper by Borgani et al.” for this specific sample). 

The high-resolution regions are large enough to resolve several 
halos, some of which — but not all — are progenitors of the main 
clusters selected at z = 0. For the purpose of this comparison, we 
select all systems with a mass Mso 2 1.3 x 10°Mp at z = 2.16, for 
a total sample of 27 distinct objects. SZ maps are generated at the 
corresponding snapshot using the map-making software SMAC!” 
and projected onto the visibility plane. We note that, to simplify 
the comparison, the projected SZ maps are generated consider- 
ing only the thermal component and they do not encompass any 
kinetic SZ term. Despite the obvious discrepancies associated 
with the specific differences in the morphological properties and 
dynamical states of the simulated protoclusters with respect to 
the Spiderweb complex, Fig. 2 shows that the mock uv profiles 
for simulated halos with mass in the same range of the para- 
metric estimates (~ 3 x 10! Mo) are broadly consistent with the 
ALMA+ACA measurements. 
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To further gain visual insight into the simulated protoclus- 
ters, we produce synthetic maps by projecting the simulated 
ALMA+ACA visibilities back into the image space, consistent 
with the imaging-reconstruction process of Extended Data Fig. 6. 
Three out of the full sample of simulated clusters are shown in 
Extended Data Fig. 7, with one specifically selected as exhibit- 
ing a SZ footprint resembling that observed for the Spiderweb 
protocluster, one as the most massive halo in the simulated set 
and another as exhibiting a displacement between the SZ peak 
and the halo centre analogous to what is observed between the 
SZ signal and the central galaxy in the Spiderweb complex. We 
checked that, for the entire sample, the physical displacement 
between the dark-matter halos and the corresponding dominant 
galaxies were never larger than about 6 kpc, more than an order 
of magnitude smaller than the observed SZ-galaxy displacement 
in the Spiderweb protocluster. In the context of the discussion on 
such an offset, the halo centroid and galaxy for each of the simu- 
lated halos can thus be considered to be consistent. To introduce 
realistic noise and optimally reflect the noise properties of the 
available observations, we jackknife the original measurements 
by changing the sign of a randomly selected half of the visibili- 
ties. This noise-like realization is finally added to the idealized 
SZ data. Consistent with the uv result above, we can observe an 
overall agreement in terms of the average amplitude of the SZ 
surface brightness in the reconstructed images between the SZ 
signal measured in the ALMA+ACA data (Extended Data Fig. 6) 
and the estimate for the simulated low-mass system. We note 
that the selection of a specific projection axis has only a minor 
effect on the measured SZ amplitude. Over the whole sample, 
the change in both the peak and the average SZ signal associated 
with a different projection exhibits a RMS variation of about 10%, 
reaching a maximum of ~ 30% in only three cases, corresponding 
to objects that manifest morphological disturbances. 

In any case, what is most important in such a comparison is 
the possibility of gaining a sensible understanding of the effect 
of radio-interferometric filtering on the observed distribution of 
the SZ signal. In the specific case shown in the bottom row of 
Extended Data Fig. 6, we can observe that the displacement be- 
tween the SZ distribution and the centre of the corresponding 
dark-matter halo (in turn roughly coincident with the position 
of the most massive galaxy in the structure; see above) is ex- 
aggerated by the radio-interferometric suppression of the bulk, 
relatively symmetric SZ signal on large scales. The net result is 
an enhancement of local SZ asymmetries that finally drive the 
definition of the observed ICM morphology. In fact, the observed 
offset is not indicative of an absolute displacement between the 
ICM and the Spiderweb galaxy but rather of a positional mis- 
match between the peak of the pressure distribution and the 
radio galaxy, typical of disturbed systems. 

It is clear that, to move this comparison beyond the sole qual- 
itative analysis and to perform a robust cross-characterization 
of the observed and simulated SZ signals, a broader exploration 
of the model parameter space in the available hydrodynamical 
simulations would be required. Such a detailed investigation 
will however be deferred to a future work. 
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Extended Data Fig. 7 | Mock observations of simulated galaxy protoclusters. Compared are the input Compton y maps from the Dianoga simulation set (left), and 
the images reconstructed from the simulated interferometric observations without (centre) and with (right) adding the jackknife-based noise realization. All maps are 
centred on the centroid of the dark-matter halo of each structure (also denoted as dotted crosshairs in the right panels), roughly corresponding (modulo a negligible 
displacement < 6 kpc) to the position of the most massive galaxy in each system. The top row shows the maps for a simulated halo selected to generate a SZ signal with 
amplitude consistent with that measured for the Spiderweb protocluster. For comparison, we show in the middle row the most massive halo in the Dianoga sample. 
The bottom row corresponds instead to a simulated halo exhibiting a notable offset between the centre of the dark-matter halo and the peak in the pressure distribution 
(that is, SZ signal), with an overall displacement similar to what was found for the SZ footprint and the central galaxy in the Spiderweb complex. The asterisk on the 
colour scale label denotes the case in which the model surface brightness map is rescaled to match the signal-to-noise ratio of the ALMA+ACA measurements and, in 
turn, enhance the corresponding mock observation in the right panel. The white square in the left panels denotes the field of view of the mock observation maps in the 
central and right columns. To facilitate the comparison, we set the colour map for the right frames equal to the one used in Extended Data Fig. 6. The solid contours in 
the right panels trace the noiseless mock observations in the central column, with contours starting from 1o and increasing by 1ø steps (for ø = 4.10 uJy beam™!). On 
the one hand, the limited signal-to-noise ratio for any of the halos of the sample highlights the need for deeper observations in view of a proper characterization of the 
morphological and physical properties of the forming ICM. On the other hand, the identification in the Dianoga set of halos with local asymmetries in their pressure 
distribution supports the case for the substantial offset observed in the ALMA+ACA data between the SZ signal and the Spiderweb galaxy not to be representative of 
large-scale displacements but the result of a non-trivial combination of irregular morphology and large-scale interferometric filtering. 
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